Joint extraction and prediction of fujisaki's intonation model parameters

نویسندگان

  • Pablo Daniel Agüero
  • Klaus Wimmer
  • Antonio Bonafonte
چکیده

This paper presents a joint extraction and prediction framework for intonation modeling applied to Fujisaki’s intonation model for text-to-speech conversion. Previous methods in the area extract the parameters of accent and phrase commands for each sentence. Then, these parameters are related to linguistic features for prediction. In our approach commands that share the same linguistic features are globally estimated. This approach intends to overcome some consistency problems of the extracted model parameters. The global nature of the parameter optimization avoids the interpolation step, which sometimes can produce a bias in the extracted parameters. Experimental results show that the higher consistency of the parameters result in a higher accuracy when the fundamental frequency contours are predicted.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Intonation modeling for TTS using a joint extraction and prediction approach

This paper presents a joint extraction and prediction framework for intonation modeling. The intonation model is based on a superpositional approach using Bézier curves. The components are attached to minor phrase and accent group. A greedy algorithm performs succesive partitions on training data using linguistic information. The parameters related to each partition are obtained using a global ...

متن کامل

Estimation of the parameters of the quantitative intonation model with continuous wavelet analysis

Intonation generation in state-of-the-art speech synthesis requires the analysis of a large amount of data. Therefore reliable algorithms for the extraction of the parameters of an intonation model from a given F0 contour are required. This contribution proposes improvements concerning the extraction of the parameters of the quantitative intonation model developed by Fujisaki. The improvements ...

متن کامل

New rule-based and data-driven strategy to incorporate Fujisaki's F 0 model to a text-to-speech system in Castillian Spanish

We will present the analysis of a Spanish prosody database by estimating the parameters of Fujisaki's model for FO contours. These parameters are classified attending to linguistic features and they form the analysis database. When synthesizing FO contours we extract the linguistic features from the text and perform a k-Nearest Neighbour search. Linguistic feature comparison distance is trained...

متن کامل

Prediction of intonation patterns of accented words in a corpus of read Swedish news

This paper describes an initial attempt at the construction of a data-driven model of Swedish intonation. The study is mainly concerned with model building and prediction of the intonation patterns of accented words in a corpus of read news in Swedish. Extraction of pitch information is achieved by performing a stylization of the pitch contours. The information is used to build a model for the ...

متن کامل

Prediction of intonation patterns of accented words in a corpus of read Swedish news through pitch contour stylization

This paper describes an initial attempt at the construction of a data-driven model of Swedish intonation. The study is mainly concerned with model-building and prediction of the intonation patterns of accented words in a corpus of read news in Swedish. Extraction of pitch information is achieved by performing a stylization of the pitch contours. The information is used to build a model for the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004